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Abstract 



OO 

o 

O 

rsQ ' We provide a simple explicit estimator for discretely observed Barndorff-Nielsen and 

Shephard models, prove rigorously consistency and asymptotic normality based on the single 
assumption that all moments of the stationary distribution of the variance process are finite, 
and give explicit expressions for the asymptotic covariance matrix. 
^vj . We develop in detail the martingale estimating function approach for a bivariate model, 

^SJ ' that is not a diffusion, but admits jumps. We do not use ergodicity arguments. 

We assume that both, logarithmic returns and instantaneous variance are observed on a 
discrete grid of fixed width, and the observation horizon tends to infinity. As the instanta- 
neous variance is not observable in practice, our results cannot be applied immediately. Our 
purpose is to provide a theoretical analysis as a starting point and benchmark for further 
(~| ■ developments concerning optimal martingale estimating functions, and for theoretical and 

f r^ ' empirical investigations, that replace the variance process with a substitute, such as number 

I , or volume of trades or implied variance from option data. 

KEYWORDS: 

^ ' Martingale estimating functions, stochastic volatility models with jumps, consistency and asymp- 

0^ , totic normality 

^ ! 1 Introduction 

^D I In |BNS01| Barndorff-Nielsen and Shephard introduced a class of stochastic volatility models in 

continuous time, where the instantaneous variance follows an Ornstein-Uhlenbeck type process 

driven by an increasing Levy process. Those models allow flexible modelling, capture many 

stylized facts of financial time series, and yet are of great analytical tractability. For further 

k>( \ information see also [BNNS02] . BNS-models, as we will call them from now on, are afRne models 

j_j ■ in the sense of |DPSOO| and [DFS03J . where the associated Riccati type equations can be solved 

C^ I up to quadrature in general. In several concrete cases the integration can be performed explicitly 

in closed form in terms of elementary functions, see |NV03| and [VenOlj . 

BNS-models have been studied from various points of view in mathematical finance and 
related fields. In [NV03| option pricing and structure preserving martingale measures are studied. 
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In [BK05[ 'BMB05, ' BG05[ IRS06J the minimal entropy martingale measure is investigated. The 
papers [BKR03. ,Lin^ address the portfolio optimization problem. Bayesian/MCMC/computer 
intensive estimation is already in the seminal paper |BNS01j , and in the works |RPD04l IGSOl) 
IFSS01|,ltH03| . The papers [Jam05irjam06| exploit the analytical tractability to develop maximum 
likelihood estimation using the results of [CMOOi ICR90J for Dirichlct processes. BNS models are 
also treated in the textbooks [CT04[ ISch03| . 

Unfortunately, it seems that statistical estimation of the model is the most difficult problem, 
and most of the work in that area is focused on computationally intensive methods. 

The contributions of the present paper are as follows: first we develop a simple and explicit 
estimator for BNS models. Secondly, we give rigorous proofs of its consistency and asymptotic 
normality. In doing so we compute explicitly the asymptotic covariance matrix and develop to 
that purpose formulas for arbitrary bivariate integer moments of returns and variance. Thirdly we 
provide a detailed application of the theory of martingale estimating functions in a non-diffusion 
setting, including numerical illustrations. 

The literature on estimation for discretely observed diffusions is vast, a few references are 
[Uch04bl ITfch04ia rD504l IMHMl [EF02l Ili^^Ol [Ji^^OTl ^^U^ IHgoTl iKesOOl lKS99al [g0?97[ IES95] . 
In particular, the martingale estimating function approach is used, developed and studied for 
example in |S099| , |S0OO| , |S097| . In the diffusion setting the major difficulty is that the transition 
probabilities are not known and are difficult to compute. In contrast to that, the characteristic 
function of the transition probability is known in closed form for many BNS models and the 
transition probability can be computed with Fourier methods with high precision. Yet the model 
exhibits other peculiarities, see the remarks in section [2.31 

In the present paper we explore the joint distribution of logarithmic returns X and the 
instantaneous variance V supposing that both processes can be observed in discrete time. Since 
the joint conditional moment-generating function of {X, V) is known in closed form we obtain 
closed form expressions for the joint conditional moments up to any desired order which yields 
a sequence of martingale differences. We employ then the large sample properties, in particular 
the strong law of large numbers for martingales and martingale central limit theorem. In this 
way we do not need ergodicity, mixing conditions, ctc|^ 

The remainder of the paper is organized as follows: in section lTTJ we describe the class of BNS 
models in continuous time and present two concrete examples, the F— OU and IG-OU model. In 
section 12.21 we introduce the quantities observed in discrete time that are used for estimation. 
Section 12.31 contains some remarks of particular features of the model and its estimation. In 
section [3] we present the estimating equations, their explicit solution which is our estimator and 
prove its consistency and asymptotic normality. In section U we present numerical illustrations. 
In section [5] we sketch further and alternative developments, in particular considering the issue 
that volatility is typically not observed in discrete time. Explicit moment calculations of any 
order can be found in Appendix \X[ In Appendix [B] we provide for the reader's convenience a 
simple multivariate martingale central limit theorem. 



^Let us mention though, that the martingale strong law and the ergodic theorem have similar proofs and can 
be derived from a common source, |Rao73) . 



2 The model 

2.1 The continuous time model 

2.1.1 The general setting 

As in Barndorff-Nielsen and Shepard |BNS01j . we assume that the price process of an asset S 
is defined on some filtered probability space (ri,jF, (J^t)t>o,P) and is given by St = S'oexp(Xt) 
with 5*0 > a constant. The process of logarithmic returns X and the instantaneous variance 
process V satisiy 



dX{t) = (^ + pV{t-))dt + y^V{t-)dWg{t) + pdZxit), X{0) = 0. (f ) 

and 

dV{t) = -XV{t-)dt + dZxit), V{0) = Vo, (2) 

where the parameters /i, f3, p and A are real constants with A > 0. The process VF is a standard 
Brownian motion, the process Z is an increasing Levy process, and we define Z\{t) = Z{Xt) for 
notational simplicity. Adopting the terminology introduced by Barndorff-Nielsen and Shepard, 
we will refer to Z as the background driving Levy process (BDLP). The Brownian motion W and 
the BDLP Z are independent and {Tt) is assumed to be the usual augmentation of the filtration 
generated by the pair {W,Z\). The random variable Vq has a self-decomposable distribution 
corresponding to the BDLP such that the process V is strictly stationary and 

E[Vo] = C, Var[-l/o] = v (3) 

To shorten the notation we introduce the parameter vector 

0=(A,C,ry,/i,/3,p)^, (4) 

and the bivariate process 

X^{X,V). (5) 

If the distribution of Vq is from a particular class D then X is called a BNS-D0U(6') model. 
The process {Xt,Vt)t>Q is clearly Markovian. 

2.1.2 The r-OU model 

The P-OU model is obtained by constructing the BNS-model with stationary gamma distribution, 
Vq ~ r(i/, a), where the parameters are v > and a > 0. The corresponding background driving 
Levy process Z is a compound Poisson processes with intensity v and jumps from the exponential 
distribution with parameter a. Consequently both processes Z and V have a finite number of 
jumps in any finite time interval. 

For the F-OU model it is more convenient to work with the parameters v and a. The 
connection to the generic parameters used in our general development is given by 

( = -, ,--,■ (6) 

As the gamma distribution admits exponential moments we have integer moments of all orders 
and our Assumption [T] below is satisfied. 



2.1.3 The IG-OU model 

The IG-OU model is obtained by constructing the BNS-model with stationary inverse Gaussian 
distribution, Vq '^ (<5, 7), with parameters S > and 7 > 0. 

The corresponding background driving Levy process is the sum of an IG((5/2, 7) process and 
an independent compound Poisson process with intensity 5j/2 and jumps from an r(l/2,7^/2) 
distribution. Consequently both processes Z and V have infinitely many jumps in any finite time 
interval. 

For the IG-OU model it is more convenient to work with the parameters S and 7. The 
connection to the generic parameters used in our general development is given by 

As the inverse Gaussian distribution admits exponential moments we have integer moments of 
all orders and our Assumption [1] below is satisfied. 

2.2 Discrete observations 

We observe returns and the variance process on a discrete grid of points in time, 

= to < <i < . . . < i„. (8) 

This implies 

U(i,) = U(i,_i)e-^(*'-*-i) + / " e-^(*'-^)dZA(s). (9) 

Using 

y, := V{U), Ur.= r e-^^'^-'UZx{s) (10) 

we have that {Ui)i>i is a sequence of independent random variables, and it is independent of Vq. 
If the grid is equidistant, then {Ui)i>i are iid. Observing the returns X on the grid we have 



X{U) - X(i,_i) = ^i{U - U^i) + (i{Y{U) - Y{U^i)) 



+ / ^V{s-)dW{s) + p{Zx{U) - Zx{U-i)), 



where 

ft 



(11) 



Y{t) ^ / V{s-)ds 
Jo 

is the integrated variance process. This suggests introducing the discrete time quantities 

x,^x{u)~x{u^i), y, = r(tO-i"(t.-i), z, = z^iu) - Zxiu^i) (12) 

and 



W, = -^ r ^V{s-)dW{s). 



(13) 



Furthermore, it is also convenient to introduce the discrete quantity 



S^ ^\iZ,- Ui). (14) 



It is not difficult to see (conditioning!) that {Wi)i>i is an iid -A^ (0, 1) sequence independent from 
all other discrete quantities. We note also that (Ui, Zi)i-y\ is a bivariate iid sequence, but Vi and 
Zi are obviously dependent. 

From now on, for notational simplicity, we consider the equidistant grid with 



where A > is fixed. This implies 

and 

where 



tfe = A:A, (15) 

y^ = iV.-i + U, (16) 

Y^ = eV.-i + S,, (17) 

7 = e-^^, e=l^. (18) 

Furthermore, 

X,^^i^ + I3Y, + ^/y,W, + pZ,. (19) 

The sequence (Xi,Vi)i>o is clearly Markovian. From now on we assume all moments of the 
stationary distribution of Vb exist. 

Assumption 1 

E[Vi'] < oo Vn e N. (20) 

In the estimating context we assume all moments are finite with respect to all probability mea- 
sures Pg,9 Cz under consideration, where O is the parameter space. 

No other assumptions are made, and all conditions required for consistency and asymptotic 
normality of our estimator will be proven rigorously from that assumption. 

Proposition 1 We have for all n E ^ that 

E[Z1] < cx), E[U^] < cx), E[S^] < cx), (21) 

and 

E[Y^] < oo, E[W^] < oo, E[X^] < oo. (22) 

Consequently the expectation of any (multivariate) polynomial in Zi,Ui, Si,\/Yi,Wi, Xi exists 
under Pg . 

Proof: We will use repeatedly the well-known relation between the existence of moments and the 
differentiability of the characteristic function of a random variable, see |CT971 Theorem 8.4.1, 
p.295f], for example. 

Let (f){t) denote the characteristic function of Vq. By assumption _Ee[V^"] < oo for all n e N. 
Thus (f){t) is arbitrarily many times differentiable. The law of Vb is self-decomposable, thus 
infinitely divisible and 0(t) ^ for all i G R. Thus the Fourier cumulant function K[t) = 
log(/)(t) is arbitrarily many times differentiable. It follows from BNSOl, equation (12)], that 
the characteristic function of Z{1) is ili{t) ~ exp(fK'(t)). Thus i^{t) is arbitrarily many times 
differentiable and consequently E[Z{1)'^] < oo, for all n e N. As Z is a Levy process this implies 
E[Z{\Y] < oo, and as Zi = Z{X) we have shown E[Zi] < oo, for all n €\!4. 

From Uni) and ^ we have Ui < Zi and Si < \-^Zi so ^[C/^"] < oo and E[S'^] < oo for 
all n e N. As Wi has a standard normal distribution it follows trivially £'[W"] < oo for all 
n e N. Repeated application of the binomial resp. multinomial theorem, the Holder and the 
Cauchy-Schwarz inequalities yields £^[^1"] < 00 and £^[A"] < 00 for all n e N, and the final 
conclusion for polynomials. □ 

Let us remark that, by the stationarity, the above result holds also for Zi,Ui, Si, \fYi, Wi, Xi 
instead of Z\,U\,S\, \/Y\, W\,X\, where i e N is arbitrary. 



2.3 Some remarks 



Most work on estimating functions is developed for diffusions, see for example |S097[ IUch04bl 
IUchn4al[DS04llMRn3LlKP(M[ndil[ndiTl [S^0ll [BS^ although it 

is often remarked that the results extend to Markov chains. Yet the models under consideration 
here display several peculiarities. 

One assumption that is usually made is that the transition probabilities under Pg have the 
same support for each 0. Typically the support of the conditional distribution of Vi in a BNS 
model given Vq = v \s {ve~ ,-\-oo) under Pg, thus depends on 0. This does not affect our 
analysis. The experiment is not homogeneous, cf. [Str85] . 

If the BDLP is a compound Poisson process, as in the F— OU case, we have the atom of 
the conditional distribution of Vi given Vo = v under Pq at the parameter dependent position 
ve~ . Consequently no dominating measure exists and maximum likelihood cannot be defined 
in the usual way. There is an alternative definition covering that case, cf. jKW56| [Joh78j . but 
we have not exploited that direction further. See also 'N S03I . This problem does not appear 
with an infinite activity BDLP such as in the IG-OU model and standard maximum likelihood 
estimation could be studied. 

The description given in sections 12.11 and 12.21 provides a BNS model for each 9, but not a 
statistical experiment as it is taken as a starting point in section |31 The reason is that the 
processes X and V will depend on 9. This can be avoided by introducing a statistical experiment 
generated by a BNS model. In analogy to the statistical experiment generated by a diffusion, 
see |SSOO| . This means we take the distribution oi X and V on the Skorohod space (D^,;B(D^)) 
under each Pg as a starting point. 



3 The simple explicit estimator 

3.1 The simple estimating equations and their exphcit solution 

For estimation purposes we consider a probability space on which a parameterized family of 
probability measures is given: 

{n,T,{Pg:9eQ}), (23) 

where Q = {9 e R^ : 9^ > 0,9'^ > 0,9^ > 0}. The data is generated under the true probability 
measure Pg^ with some 9o £ Q. The expectation with respect to Pg is denoted by Eg[.] and with 
respect to P^o simply by E[.]. 

We assume there is a process X that is BNS-D0U(6') under Pg. We want to find an estimator 
for 9o using observations Xi, . . . , X„, Vi, . . . ,Vn. We are interested in asymptotics as n ^ oo. 
To that purpose let us consider the following martingale estimating functions: 



)]. 



GUo)^E''U[Vk~fHv,^u0)], 

Gm = j:'LAyk-PiVk-ue)], 

Gt{0)^ELAXk~fHVk-ue)], 

Gm^ELi[XkVk-i-.fHVk-u9)] 
Gli9)^El=i[XkVk-f%V,.^,9)], 



f'{v,9) 


^Eg[Vi\Vo^v] 


P{v,9) 


^ Eg[ViVa\Va ^ v] 


p{v,9) 


= Eg[VAVn=v] 


f\v,9) 


^Eg[Xi\Va^v] 


fHv,0) 


= Eg[XiVo\Vo = V 


p{v,9) 


^ Eg[XiVi\VQ ^ V 



(24) 



Lemma 1 We have the explicit expressions 

Piv, 6) = 72^2 + 27(1 - 7)C« + (1 - 7)2^2 + (1 - ^2),^ 

f{v,6)=Pev + ii^ + l3{\-e)C + pK ^ ' 

f{v, e) = Pev^ + (/iA + /3{1 - e)C + pXQv 

f{v, 6) = PeW + ((mA + /3(1 - e)C + pXOl + /5e(l - l)Ov + (1 - e)(l - 7)C' + e'A77 

Proof: The formulas are special cases of the general raoment calculations given in Appendix \^ 
In order to demonstrate the basic idea we will prove the statements for two special and simple 
cases here, namely for f^{v,6) and f'^{v,d). From ([TH|) it follows that 

Eg[Vi\Vo = v] = jv + Eg[Ui] (26) 

and from the stationarity of V we have 

Ee[Ui]^{l~j)Eg{Vo) = {l-j)C- (27) 

Furthermore, from (fT^ and the fact that i?[VKi] = 0, it follows that 

Eg[Xi\Vo =v]=pA + (3Eg[Yi\Vo ^ v] + pEg[Zi\Vo - v]. (28) 

But, from (fT7|) we have that 

Ee[Yi\Vo = v]^ev + \Ee[Zi - Ui] = ev + ((1 - e), (29) 

A 

and 

Eg[Z{\ = AC. (30) 

So, from (gSl) it follows that 

Ee[Xi\Va =v]=pJ\ + /Sev + /3(1 - e)C + pAC- (31) 

D 
The estimator 9„ is obtained by solving the estimating equation Gn{0) = and it turns out 
that this equation has a simple explicit solution. 

Proposition 2 The estimating equation Gn{On) = admits for every n > 2 on the event 

Cn = {a - ?^n > 0, «^ - [vlf > 0} (32) 

a unique solution 0„ — (XmCrnVn, Pm Pm Pn) that is given by 

in = {en-envi)/{vi-Kn, 

Cn = {ill - Invi) I {I ~ In)] 

rin = ((a - {il?) - llivl - {vif))l{l - ll); 
A„ = -log(7„)/A; 
e„ = (1 -7n)/An; 

Pn^{en-vUt)/{er.{vl~{vlf))- 

Pn = (^ - iXi - /9nen(r]„(l " In) + InK - K)^))) / {2{l - 7„)77„); 

M« = (Cn - PneniVn - Cn))/A - (/3„ + XnP,i)Cn; 



where 



en = iEv. en = ^Ev.v.-i en = iEK' 



i=l 



(34) 



in — n ^ -^i in — n^ ^i^'.-l ^.n — n ^ -^^^i 



vi^iEV^-l vl = ^j:vl, (35) 

i=l i=l 

Proof: The first three equations G^„{6) = 0, for j = 1,2,3 contain only the unknowns CiVi'^ 
and are easily solved. In fact we get a familiar estimator for the first two moments and the 
autocorrelation coefficient of an AR(1) process. The last three equations G^(0) = 0, for j = 4, 5, 6 
can be seen as a linear system for the unknowns /x, /3, p, once the other parameters have been 
determined. D 

Remark 1 The exceptional set C„ could be simplified to 

C'n = {C - iWn > 0} (36) 

Since the jump tim,es and the jump sizes of the BDLP are independent, and the former have 
an exponential distribution it follows that Vo,...,V^ is with probability one not constant, so 
P[v'^ — (f^)^ > 0] = 1. But although it can be shown that the probability of Cn tends to zero, for 
finite n we have P[£,n — Cl^n !i 0] > 0. This is the common phenomenon that sample moments 
do not share all properties of their theoretical counterparts. For definiteness we put 6n — 
outside Cn- 

3.2 Consistency 

Let us investigate the consistency of the estimator from the previous section. First, we will need 
the following lemma. 

Lemma 2 For every fc > 1 and p > 

^^^0 as n^oo. (37) 

Proof: The random variables {14,, n > l} are identically distributed and m^ — E[\Vi\] < oo for 
all fc > 1. Thus we are in the situation of [Sto74i Exercise 2.1.2(i), p. 14]. 

Let fc > 1 and e > be arbitrarily chosen. Taking any integer a > 1/p and using the 
Chebyshev inequality we obtain 

Vpf^ >,)<V^^^<V^^<oo. (38) 



nP 



71—1 n— 1 



Therefore from the Borel-Cantelli lemma it follows that P(limsup„ n P\Vn\ > e) = 0- 
Lemma 3 We have for all k E \!4 that 



1 " 

-Y.V," ^ E[V,% (39) 



n 

i=l 



as n —> oo . 



Proof: We will prove this statement by induction. 
(1) k = 1. Let us define 

H, = V,-E{V,\V,-i), i>l. 

Obviously, {Hi, i > 1) is a sequence of martingale differences and is therefore uncorrelated. 
Using expressions ([T0|) and (fT6|) we obtain 

E[Hf] - EiV;") ~ E[E{V,\V,-i)^] 

= (1 - 7')i^(K,') - 2^EiUi)E{Vo) + EiUl), 

so ii^[-ffj] have a common bound for every i > 1. Since the assumptions of the Theorem 5.1.2 
from IChu01| are satisfied, it follows that 

-y,V,--y,E{V,\V,^{)^^Q, as n^oo. 

But using again definition p6p . the last expression is equivalent to 



n 



Ki H E[Ui) — > 0, as n ^ oo. 



as n — !■ CXI. 

i=l 



Finally, using the result of the previous lemma and the fact that E[Ui] = (1 — 7)i?[Vb], it follows 

n 

-Y.V,^E{Vo), 

i=l 

This completes the proof for fc = 1 . 

(2) Suppose now that the statement of the theorem holds for Z < fc — 1, i.e. E{Vi ~ ) < oo and 

1 " 

-Y^V!^E{Vi), l<k-l (40) 

1=1 
when n —^ oo. For fc > 1, and for i > I, let 

n 

ijf :-y^-i?[y^|y._i] and 5^=^i/^ (41) 

i=l 

Obviously, (i^^", i > 1) is a sequence of martingale differences and in particular is uncorrelated. 
Moreover, due to the strong stationarity of the volatility sequence and relations (PT|) and (fTBl) 
we obtain 

E[H^]^ = E[V^'' - 2V^E[V^\V^-i] + E[V^\V,-i]^] 
^ E[V^']^E[{E[V^\V,^^])'] 
< E[V^''] ^■. Ck, (42) 

Cfc denoting some constant that does not depend on i. Hence, by |Chu01[ Theorem 5.1.2, p. 108], 

S'' as 

it follows that —^ ~^ when n — > oo, which in our case, due to the definition of S',, is equivalent 

n 
to 

^ n 1 ^ 

-T.V^--T.E[y'\y^-^]'^^- (43) 

i=l i=l 



Using again the definition (fTHj) and the independency of Ui from Vi-i, for z > 1, we obtain 



71 k 



'-T.y^'~'-T.m'\y^-A - ^E^/-^EE(-)7^e.i^K-] 



n '■ — ' 71 '■ — ' n '■ — ' n '■ — ' '■ — ' \ 7 

i=\ i=\ i=\ i=\ j=o ^-^ 

1=1 j=0 ^-^^ 4=1 

Finally, applying the assumption of the induction, Lemma [2] and the statement (|43p , we obtain 

^ E ^^'^ - T^ E (%'E[ur]EiV^) - EiV,% 
" j=l ^ ' j=0 ^^^ 

where the last equality follows calculating E{'jVq + Ui)'' using (fT6|) . □ 

In the next lemma we extend the strong law of large numbers for {Vf , i > 1) to more general 
sequences. 

Lemma 4 For all integers p,q,r > we have 

1 " 

-J2x!v:'V^r ^ E[X^,V,^V,^] (44) 



n 

i=l 



as n —> 00. 
Proof: : Let 



M, = X^V:^\ri, - E[XfV:^V;_,\V,.^] . 



Obviously, {Mi, i > 1) is a sequence of martingale differences, and in particular it is uncorrelated. 
It is stationary and i?[Aff] < cx3. So we can use again |ChuOH Theorem 5.12] to show 

LY^xfv^vu - -Y.E[x!v^^vu\v^-l] ^ 0. 

2—1 2—1 

The conditional expectation £^[Xf V^^T/^^]^|Vi_i] is a polynomial in V^-i, namely 

V-\-q 



fc=0 



This is shown in section A. 5 in Appendix VK\ where the coefficients 4>pqk are explicitly calculated. 
Applying Lemma [3] yields 



-1 '^ y i H 

i=l fc=0 

As we have 

p+q 

E[XIV,^V^] = E[E[XlV,^V^\Vo\] = ^ ^pqkE[Vtl. (45) 

fc=0 

the proof is completed. □ 



10 



Theorem 1 We have P{Cn) — > 1 when n — > cxd and the estimator On is consistent on Cn, 
namely 

^n > f 

on Cn as n —f oo. 

Proof: Using the results of Lemma [3] it easily follows that 

en-in^n^Cov{V^,Vo)>0, (46) 

SO P{Cn) ^- 1 as n ^ cx). 

Using again the results of Lemma [3] it follows that the empirical moments in (IM)) and (15^ 
converge to their theoretical counterparts, £,„ ~~* C and v^ — — » v^, where 

e^ = //C + (/3 + Ap)C' + (3ev, 
e^tiC+if3 + Xp)e + {P + 2p\)ev, 

Plugging the limits into (|33p shows, after a short mechanical calculation, that the estimator is 
in fact consistent. D 

3.3 Asymptotic normality 

For a concise vector notation we introduce 

Sfc = {Vk,VkVk-uV^,Xk,XkVk-i,XkVkf, (48) 

and write the estimating equations in the form 

n 

GlM = J2H-f{Vk-i,e)], . = 1,...,6 (49) 

k=l 

and p{v,6) given by (PS)) . We write 

Pi+rt+qi 



with 



and 



p= (0,0,0,1,1,1), 9^(1,1,1,0,0,1), r- (0,1,0,0,1,0) (51) 



</>K^)=7 <AJ(0) = (1 - 7)C 

ci^m^i' 03(0) = 27(1 -7)c cpm^i^-ire + {^-i')v 

^t{9)^Pe q^t{e)=^i + p{l~e)C + pXC 
cj^m^P^ (/.5(0)=^ + /3(l-e)C + pAC 

cj>l{e) = /3e7 fiie) = {{p^ + /3(1 - e)C + pAC)7 + /3e(l - 7)0 '/'[5(^) = (1 - e){l - 7)C' + ^'At? 

(52) 
We will use, that p{v,9) is a polynomial in v, and that its coefficients (p are smooth fmictions 
in 61. 

We shall first prove the central limit theorem for the estimating functions. 

11 



Proposition 3 We have 

-^Gn{9o) ^ N{0,T), (53) 

as n ^ oo, where 

T,,=E[Cov{E\,Ei\Vo)]. (54) 

Proof: To show the above result, we use the multivariate martingale central limit theorem, that 
is recapitulated in Appendix [B] To that purpose we introduce the vector martingale difference 
array 

X„,fc = ^[SJ,-r(Ffe_i,0)]. (55) 

We have to show the two assumptions from the previous theorem. First, we prove a multivariate 
Lyapuonov condition which implies the Lindeberg condition. From (j55p it follows that v^X„ i is 
of the form p(Vo, Vi, Xi) where p{vo, wi, a;i) is a polynomial in wqj ^i: xi which does not depend 
on n. Thus, n ||x„,fc|| has the same property and from the explicit moment expression from 
Appendix El it follows that 

E[\\xn.k\\^\^k-i] ^^q{Vk-i), (56) 

where q{vo) is a polynomial in vq. From Lemma[3]it thus follows 

1 " 

-Y,q{V,^,) ^ E[q{V,)], (57) 

fc=i 

where the expression on the righthand side exists and is finite. Thus the first condition of the 
martingale central limit theorem is satisfied. In order to verify the second condition of the same 
theorem we consider the (i, j)— th element of the matrix Xn,kXn k "which is given by 

^{El^ fiVk-uO)){Ei~ r{Vk-i,e)). (58) 

This is again a polynomial in T4_i, Vfc and Xk so by Lemma H] it follows that 

1 " 

-J2H- fiVk-i,e)){Ei~ fiVk-uO)) ^ E[{El- f{Vk-i,e)){Ei~ fiVk-uO))] (59) 

" fc=l 
as n ^- oo. □ 

Remark 2 A systematic method to evaluate T is given in AvvendixV^ and the resulting explicit 
expressions are given in \Pos07^ . 

Lemma 5 We have 

^[Cn-C] ^A^(0,S), (60) 

where 

I] = p-iT(p-i)^ (61) 

and 

Pij = Sij - (l)\6ij - (jylSsj (62) 

with dij denoting the Kronecker delta. 
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Proof: We can write 

-^Gni9o)=PV^{^n-0+Qn, (63) 

with 

Qn = -^ [</>! {Vn ~Vo)+ 4>\ (K? - ^o')] ■ (64) 

In view of Lemma [5] above we see, that the remainder term Q„ goes to zero in probabihty as 
n —^ cxD. As P has determinant (1 — 7)^(1 + 7) > it is invertible, and we have 

V^{^n -0= P-^ (-^Gn{ea)\ + Rn (65) 

with Rn = —P~^Qn going to zero in probability as n ^ oo. The expression P~^ (n~^/^Gn{9o)) 
is asymptotically normal with mean and covariance matrix S. An application of Slutsky's 
Theorem proves the lemma. □ 

Finally, we have all the ingredients for proving the following result. 

Theorem 2 The estimator 

Gn = ( A„ , Cn , Vn , ^n , /?« , Pn ) (66) 

is asymptotically norm,al, nam,ely 

V^[k-e^]^N{Q,T), (67) 

as n —t oo, where 

T = DT.D^ (68) 

and D is explicitly given according to i69\) . 

Proof: We observe from ([55|) that 0„ = 9{(.n,i->n), where g is well defined and continuously 
differentiable in a neighborhood of ($,u). Using the Taylor expansion in the last two variables 
we have 9„ — h{S,n) + Sn, where h is well defined and continuously differentiable in a neighborhood 
of ^, and Sn goes to zero in probability in view of Lemma [21 Thus it can be neglected according 
to Slutsky's Theorem. We apply the delta method, see [Leh99| for example, and compute the 
Jacobian matrix D with 

A, = 1^(0, *,j = l,...,6. (69) 

A lengthy elementary calculation shows that the matrix has determinant A/ (2(1 — 7)^7?/''), thus 
it is invertible. □ 

Remark 3 For comparison it is instructive to study our simple estimator in the general frame- 
work of \S099^ where the properties of the estimator are studied without exploiting the fact that 
the estimating equation allows an explicit solution. This is done in ^PosOTj. There the theory 
is extended in the case of a bivariate Markov process and Condition 2. 6 of \S099]I is proven in 
order to use his Corollary 2. 7 and Theorem 2. 8. 
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4 Numerical illustrations 

4.1 Description of the model and its parameter values 

To illustrate the results from the previous sections numerically, we consider the F-OU model 
from Section I2.1.2[ where the variance V has a stationary gamma distribution. We use as time 
unit one year consisting of 250 trading days. The true parameters are 



2.56, 



64, A = 256, f3 



-0.5, 



-0.1, ^==1.2. 



(70) 



The parameters imply that there are on average 2.6 jumps per day and the jumps in the BDLP 
and in the volatility are exponentially distributed with mean 0.0156. The interpretation is, that 
typically every day two or three new pieces of information arrive and make the variance process 
jump. The stationary mean of the variance is 0.04. Hence, if we define instantaneous volatility 
to be the square root of the variance, it will fluctuate around 20% in our example. The half-life 
of the autocorrelation of returns is about half a day. 

In our example annual log returns have (unconditional) mean 25.6% and a annual volatility 
20%. Figure [T] displays a simulation of one year of daily observations from the background driv- 
ing Levy process, from the instantaneous variance process, and log returns, or more precisely, 
simulated realizations of Zi, Vi, and Xi ior i = 1, . . . , 250. In [Pos07| other scenarios are consid- 
ered, for example, small jumps arriving every minute, with fast decaying autocorrelation, or few 
jumps per year, corresponding to exceptional news with heavy impact on the variance process. 



4.2 The asymptotic covariance matrix of the estimator 

As our goal is an analysis of the estimator, and not an empirical study, we do not estimate the 
asymptotic covariance, but evaluate the explicit expression using the true parameters. Denoting 
the vector of asymptotic standard deviations of the estimates and the correlation matrix by sj -Jn 
resp. r we have 

4.86 

125 

650 u.«±i u.«± 1 u.uu u.z,i u (n^\ 

7 35 ' "^ ~ (\(\'X n OQ n OR i _n 7^; (\(\p. ^' > 

253 

0.526 



1 


0.89 


0.41 


0.03 


0.09 


-0.02 


0.89 


1 


0.4 


0.03 


0.09 


-0.03 


0.41 


0.4 


1 


0.06 


0.22 





0.03 


0.03 


0.06 


1 


-0.75 


0.06 


0.09 


0.09 


0.22 


-0.75 


1 


-0.57 



-0.02 -0.03 



0.06 



-0.57 



4.3 Distribution of the estimates 



Figure 14.31 illustrates the empirical and asymptotic distribution of the simple estimators for the 
F-OU model. The histograms are produced from m = 10000 replications consisting of n = 8000 
observations each, corresponding to 32 years with 250 daily obervations per year. We see from the 
graphs that in our illustration the parameters v^ a, A, and /i can be estimated quite accurately, 
in the sense that the usual confidence intervals yield one or two significant digits at least. The 
estimate for p is not as accurate and the accuracy for the estimate for (3 is unsatisfactory. 

The bad quality of the estimator for (3 is neither surprising nor very troublesome. It has 
little impact on the model. The main reason for including the parameter /3 in the specification 
of BNS models is, for derivatives pricing: A risk-neutral BNS-model must have (3 = —1/2. In 
most applications working under a physical probability measure (3 — can be assumed without 
much loss of generality or flexibility. 

In ongoing work [HP06| we compare this asymptotic covariance with the covariance of the 
optimal quadratic estimating function. 
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5 Further and alternative developments 

5.1 Optimal quadratic estimating functions 

Our choice of estimating functions is natural, but, mathematically speaking, somewhat arbitrary. 
In ongoing work [HP06J we show, that the optimal quadratic estimating function based on the 
moments of Vi, Xi, Vi, V^Xi^Xl can be computed explicitly, though the corresponding estimator 
has to be determined numerically. Our simple estimator can be used as a starting point for an 
iterative root-finding procedure. Consistency and asymptotic normality can be shown using the 
general theory as presented in |S099| along the lines of the present paper, although the expressions 
involved are slightly more complicated. 

5.2 Using more integer or trigonometric moments for better efficiency 

More efficient estimators than provided by the optimal quadratic estimating function can be 
obtained by incorporating further moments. As we have provided explicit computations for arbi- 
trary integer moments and conditional moments, our methods can be extended to that situation. 
We might even have the number of moments tend to infinity with the number of observations, 
and obtain an estimator that is asymptotically equivalent to the maximum likelihood estimator, 
when the latter exists resp. can be defined, see 12.31 The reader might object, that very high mo- 
ments are not reliable for empirical investigations. BNS-models allow also explicit computation 
of the characteristic function and thus of conditional and unconditional trigonometric moments 
£:[e*(?fcVi+V'feXi)] and £:[e*(«'=^i+'^'=-^i)|Vo] for arbitrary constants ^k and Vfe, that could be used 
instead for constructing estimating functions. See [AS02J for diffusions, |Sch05| for Levy type 
processes, and ^Sin01_ for afiine models. 

5.3 Intra-day observations 

Our approach is based on the explicit calculation of conditional and unconditional moments. 
Those calculations can be done for BNS-models on arbitrary time intervals. Hence our analysis 
is not restricted to a fixed time grid with the number of observation intervals tending to infinity, 
but could be performed also on a fixed horizon, with the number of intra-day observations 
increasing to infinity. The resulting estimators should then be compared to power-variation 
methods, cf. |Tod06| . 

5.4 Comparison to the generalized method of moments 

We would be interested in a comparison of our results to the related generalized methods of 
moments. For a rigorous treatment of the latter, a precise specification of the weighting matrix 
is required, see |HHY96| and the references therein. 

5.5 Unobserved volatility and substitutes for volatility 

Finally, perhaps the biggest issue is, that the instantaneous variance is not observed in discrete 
time. In Lin05j it is reported, that the number of trades is an excellent substitute for statistical 
purposes. This is certainly a promising starting point for an empirical analysis. For a theoretical 
analysis a joint model for the number prices and number of trades has to be specified. 

Another direction would be to adapt the implied state method (IS-GMM) as introduced in 
[Pan02j to our martingale estimating function approach: We replace the unobserved Vi in the 
estimating equations by the model-implied variance Vi{9) that is obtained from option prices, 
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assuming that the dynamics are governed by BNS-models both under the physical probabihty 
measure P^o ^'Hd ^ risk-neutral measure Pg . The resulting estimating function will not be a 
martingale estimating function any more, and the bias has to be accounted for in a rigorous 
analysis. Nevertheless, in view of the results of [Pan02| . we are optimistic, that consistency and 
asymptotic normality will hold also here. 

A Explicit moment calculations 

This section is about computing explicitly E[XiV[^\Vo = 0] and E[X]^V™]. All moments below 
will be given in terms of the cumulants of the stationary distribution, denoted by i^„. We set 

C = K,, 77 = K2. (72) 

If the stationary distribution is determined by the two parameters C and 77 the higher cumulants 
are obviously functions of C and 77, but the formulae hold in more general cases. 

The calculations exploit the analytical tractability of the BNS-model, namely conditional 
Gaussianity of the logarithmic returns X and the linear structure of the OU-type process V. 
From that it follows, and it is well-known, that univariate and multivariate cumulants can be 
computed easily. It remains to transform multivariate cumulants to multivariate moments, again 
a topic that is well-understood, and explicit expressions involve the multivariate Faa di Bruno 
formula, multivariate Bell polynomials and integer partitions, see for example [McC87] . 

We have chosen to use simple recursions, that are easy to implement on a computer algebra 
system, in particular, since the expressions, though completely explicit and elementary, are rather 
lengthy when it comes to evaluating moments of order four for the asymptotic covariance matrix. 
For the reader's convenience, we give the details in this appendix. 

A.l Preliminaries 

Let us recapitulate the variables and notation from section [2?2l that are required in the following 
calculations. We use 

7 = e-^^, e = ^^. (73) 



We have 
where 



A 

Vi = 7I/0 + Ui, Fi = eVo + Si (74) 



/■A nA 



Ui= f e-^(^-^)rfZA„ 5i=/ A-i(l-e-^(^-^))dZA,. (75) 

Jo Jo 

Note, that we have the simpler formula Si — {Zi — Ui)/X, but the integral above is sometimes 

notationally more convenient. We have 

Xi^Ai + y/YiWi, Ai=piA + (3Yi+pZi. (76) 

A. 2 Stationary moments 

We use the well-known recursion to compute moments from cumulants 

E[Vol = Sno + E " ■ )K^+iE[vr'-l- (77) 

i=o ^ * ^ 

Alternatively we have i^iVJ'] = Yn{Ki, . . . , Kn), where Yn{xi, . . . , Xn) denotes the complete Bell 
polynomials. Explicit non-recursive expressions can be given, but we do not use them. 

16 



A. 3 Trivariate cumulants 

From the key formula for Wiener-type integrals with Levy process integrator, it follows that the 
joint cumulants of (5*1, Ui, Zi) are given by 





Knmi = Ae„m(n4- 


m 


+ ^)^n+m+^, 


A" 


-i 


\ k=\ 


> 


-1) 


fel-7'^ 
fcA 


/ 


A" 


-i 


^ 1 - 7^' 


i 

-y 

k=\ 


,k) 


(-l)'^l 


-1^ 
fcA 



(78) 



ith 

' / ' / X 1 k\ 

J = 



j>0 



(79) 



A. 4 Trivariate Moments 

Trivariate moments can be computed recursively from trivariate cumulants 

i=0 j=0 fe=0 ^ / \J / \ / 

E[s^urzi] = E E E (f) (" 7 ') (l)K.,+i^kE[sr'ur'-'zt'] (si) 

i=0 j=0 fc=0 ^ ^ ^ ^ / \ / 

E[s^urz(] = E E E (7) (") (^ ^ ') if,,,.+ii?[^r-'c/r-^'^r^-l (82) 

i=0 j=0 k=0 V*/V-^/V / 

Alternatively, we can express i?[S'"C/™Zf] as trivariate complete Bell polynomials Ynme evaluated 
at the trivariate cumulants of Si,Ui, Zi, and explicit non-recursive expressions are available, but 
not very useful for us. 

A. 5 Some conditional expectations 

Using ([74|) gives 



^[yi"vr^f|Vo = «] = EE (7) ('^)^"i'E[s'^~'ur~'z',] ■ v'+^ 



(83) 



i=0 3=0 

Collecting powers of v gives 

n+m 



ElY^^VrzilVo ^v]=J2 inmikv" (84) 



with 

lA/c 



e»™.fe = E (fc ! J (7) ^'■'7^^[^r~'+-'t/r-^^f] (85) 
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i=0 j=o \ / \ -J 



Then using ((7H)) and conditioning gives 

E[A",YrV,'\Vo = «] = E E ( J ( " / )pV^i"-'-'E[Y^+'V,'Zi\Vo = v] (86) 

Collecting powers of v gives 

ElA'lYrV.'lVa = v] = E ^nmikv" (87) 



n+m+£ 



fc=0 

with 

n n— 2 

' n \ I n — I 



Ipnnrlk^ E E(j)( ' ) /^V7*" ' ^ 6n+j/,i,fe 



Finally using (|76p and the Gaussian moments gives 



,2iy 2«i! 

1=0 

Collecting powers of v gives 

n+ra 



L"/2J . \ ('o-V 



^[Xi"i/ri^o = ^'] = E '^""fc^' (90) 



k=0 

with 

(pnmk = 2^ \^. j-^-^Wn-2i,i,m,k (91) 

It follows from the calculations above that 4'nmk are polynomials in 7, e, /i, /3, p. 

A. 6 Some unconditional expectations 

The same structure pertains for the unconditional expectations, 

" ™ / \ / \ 

Eivrvrzi] = EE r r)e'7'E[sr'ur'zi]E[v^+^] (92) 

,;=0i=0 ^*^ ^-^^ 
then 

£;[A^y™t//] = E E " ) r • ) PYfi^-'-^EiYi^'+'vIzi] (93) 

i=oj=o Vv V J / 
and finally 

i?[xrFr]= E UJ 2S^[^""''^i^"]- (^^) 

i=0 ^ ^ 
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B The simple multivariate martingale central limit theo- 
rem 

The following simple version of a multivariate martingale central limit theorem is certainly well- 
known or obvious for experts, some references are [CPOS) lKS99b|, IvZOOj . 

However, when looking for references, we found statements that do not exactly apply, or that 
are much more general (continuous time, random normalizations,.. .). It turned out that the 
elementary proof below is shorter, than an attempt to verify the assumptions and deduce the 
result from a more 'advanced' theorem. Yet, any concrete and precise hint for an appropriate 
reference would be most welcome to the authors. 

Theorem 3 Suppose (Xn^k) is a martingale difference array such that for every e > 

n 

^S[||X„,fc|pl{||^„,^ll>,j|.Ffc_i] ^0 (95) 

fe=i 

and 

n 

J2 [Xn.kXl^\:Fk-i] -^ T (96) 

fc=i 
as n -^ oo. Then 

n 

^X„,fc^iV(0,T). (97) 

fe=i 

Proof: We will use the Cramer- Wold device. For /3 £ R , (3^0, let us define a random variable 

y„,fc = /3^X„,fc. (98) 

Then we have 

n n 

J2E[Yl,\Tk-i] = ^i?[(/3^X„.fc)(Xj;,/3)|.Ffe_i|] 

k=l k^l 

n 

= /3^^£;[X„^fcX^,|.Ffc_i]/3. (99) 

k=l 

From Assumption ([96|l it follows that the expression ([99|) converges to (3^T(3 and thus 

n 

Y,E[Yl^\Tk-i]^l3''Tfi (100) 

fc=i 

as n — > oo. Furthermore, it holds 

\l3^Xn,k\<\\P\\-\\Xn,kl (101) 

Thus, for an arbitrary e > we have 



< Y.^[^lkH\Y^M>^}\H-i] 

n 

= 2^-E[(/3^X,Kfc)^l{t/3Tx„,fc|>e}|-?^fe-l] 
fe=l 

n 

< ||/3|p^i?[||X„,fe||2l{|^Tx_^^|>,}|.Ffc_i]. (102) 



k=l 
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Since for /3 7^ the condition |/3^X„^fe| > e implies 

ii^n,fcii > ii/3ir^i/3^x„,fci > m\-h, (103) 

it follows that 

l{|/3^X,.,fcl>£} ^ l{||X.,fc||>ll/3||-ie} (104) 

and thus using the Assumption ([55]) . from p02p it follows that 

n 

^£;[y„%i{iy„,i>,}|^fc_i] ^0 (105) 

as n — )■ 00. Now, the statement follows fr'om the univariate martingale central limit theorem from 
[HH80] . D 

Lemma 6 T/ie conditional Lyapounov condition implies the conditional Lindeherg condition, 
namely, if 

n 

Y,E[\\Xr,,k\\^\Tk-i]^0 (106) 

fe=i 

then 

n 

5]S[||X„,fc|pl{||;,^^||>,}|^fc_l] -^0 (107) 

fe=l 

as n —f cxD. 

Proof: For every e > we have 



n 

> e2^ii;[||X„,fe|pl{||;f„,,||>,}|^,_i], (108) 



k=l fc=l fe=l 



fc=l 

since 



Y,E[\\X^M\%\\x^..\\<c}\^k-i]>0. 



k=l 



From Assumption p06p the statement follows. □ 
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Figure 1: Daily observations Zi,Vi,Xi. 
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Figure 2: Empirical and asymptotic distribution of the simple estimators for the F-OU model. 
The histograms are produced from m = 10000 replications consiting of n = 8000 observations 
each, corresponding to 32 years with 250 daily observations per year. The true values are 
V — 2.56, a = 64, A ~ 256, /i ~ 1.2, /3 = —0.5, p = —0.1. The standard deviations used for the 
normal curves are taken from the explicit asymptotic results, not estimated. 
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